̽»¨ÊÓÆµ

̽»¨ÊÓÆµ, in conjunction with its vendor partners, sponsors hundreds of events each year, ranging from webcasts and tradeshows to executive roundtables and technology forums.

Events and Resources

Events

Runai-microsite.png
Run:ai

AI Inference Workloads: Overcoming Challenges of taking AI to Production


Event Date: May 24, 2022
Hosted By: Run.AI & ̽»¨ÊÓÆµ

AI and MLOps engineers in federal agencies often struggle to deploy models on GPUs. Most Al research initiatives never make it to production. Why? Researchers are facing bottlenecks due to static allocations of GPUs, and different technology sets complicate moving models from training to production.

Join Run.AI and ̽»¨ÊÓÆµ to learn̽»¨ÊÓÆµ to learn how your agency can overcome the challenges associated with new hardware-accelerated AI modeling practices and discover how traditional best practices have evolved to become more efficient

During this live session you will learn from our experts how to:

  • Run multiple inference workloads on the same GPU by using the concept of fractional GPUs
  • Remove the bottlenecks which prevent almost 80% of workflows from reaching production
  • Get dynamic MIG slices for each new job when using the NVIDIA A100 GPU
  • Improve GPU utilization when running inference workloads
  • Maintain high throughput and low latency for model serving


Fill out the form below to view this archived event.


Resources


No resources were found. Please try another search.